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Abstract 

We study the estimation of moments and joint moments of microstructure noise. 
Estimators of arbitrary order of (joint) moments are provided, for which we establish 
consistency as well as central limit theorems. In particular, we provide estimators of 
auto-covariances and auto-correlations of the noise. Simulation studies demonstrate 
excellent performance of our estimators even in the presence of jumps and irregular 
observation times. Empirical studies reveal (moderate) positive auto-correlation of the 
noise for the stocks tested. 



Keywords: market microstructure noise, high frequency data, joint moments, auto-covariance, 
auto-correlation 



1 Introduction 



It has long been recognized that market microstructure noise plays a significant role in fi- 
nancial markets. See, for example, the seminal paper of Black (1986) and comprehensive 



reviews of [Madhayanj (|2000j) , |Q'Hara| fl2003[ ) , |Stoll| (|2003j and |Hasbrouck| {[2007J) , among oth- 
ers. The market microstructure noise is induced by various frictions in the trading process. 
Examples of such frictions include bid-ask spread, asymmetric information of traders, the 
discreteness of price change, etc. 

With the increasing availability of high frequency data, the market microstructure noise 
has received growing attention. Despite the small size, market microstructure noise accumu- 
lates at high frequency and affects badly the inferences about the efficient price processes, 
such as the estimation of volatilities. No-arbitrage based arguments (see, for example, 



Delbaen and Schachermayer (1994)) suggests that the (efficient) price processes should nor- 



mally be semimartingales. The fundamental properties of semimartingales allow to make 
accurate inferences about volatilities and other quantities with high frequency observations. 
See, for example, Jacod and Protter (1998), Mykland and Zhang (2006), among others. 



However, for liquidly traded securities, empirical evidence such as the signature plots of 



Andersen, Bollerslev, Diebold and Labys (2000) show clear noise accumulation effect at 
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high frequency. Therefore, more recent research carefully analyzes both components of the 
market price processes: the latent semimartingale price process and the noise process. 

Several methods to de-noise the data in the context of volatility estimation have been 
proposed. For example, the two-scale method as in Zhang et al. (20051, Ai't-Sahalia et 
al\ (2005); the kernel method as in Barndorff-Nielsen et al. (2008), Barndorff-Nielsen et 



al. 



(2011); the pre-averaging method as in Jacod et al. (2009), Kinnebrock et al. (2010); 



the multi-scale method as in Zhang (2006); the quasi-maximum likelihood method as in 



Xiu (2010), Ait-Sahalia et al. (2010), among others. These methods are shown to be very 



effective when the noise is an additive white noise, or presents some kind of independence 
between successive observation times. Gloter and Jacod (2001), Li and Mykland (2007) 



and Rosenbaum (2009) studied the case when the noise is of specific form such as round- 



off errors or round-off errors on top of additive white noise. On the other hand, Hansen 



and Lunde (2006) and Ukabata and Oya (2009) have shown evidence of dependence of the 



noise in financial markets. When there is autocorrelation in the data, one possible way to 
reduce the impact of dependence is to use subsampling and averaging. |Hansen and Lunde 



(2006), Barndorff-Nielsen et al. (2008) and A'it-Sahalia et al. (2011) provide estimators when 



the noise satisfy certain weak dependence assumptions. However, the optimal subsampling 
scheme and de-noise method depend on the dependence structure. Hence understanding the 
dependence structure of the noise is essential for inferences. 

Can we understand better the statistical properties of the noise? Specifically, for a 
particular security price process, how is the noise distributed and what is the dependence 
structure? 

In this article we study how to estimate the moments and joint moments of the noise, 
based on high-frequency data. More specifically, under both settings where the observation 
times are equally spaced or are irregularly spaced, we propose estimators for (joint) moments 
of arbitrary orders of the noise. We establish consistency as well as central limit theorems 
for our estimators under certain mild mixing conditions on the noise (see Assumptions (NO- 
1) and (NO-2) below for precise statements). As is well known that under appropriate 
conditions on the tail any distribution can be fully reconstructed from its moments, our 
results allow one to understand the marginal distribution as well as the joint distributions 
of the noise. 

Simple applications of our results include estimating the auto-covariances and auto- 
correlations of the noise. And as central limit theorems are available, one can readily build 
tests for testing, for example, whether auto-correlations of particular orders vanish or not. 

The paper is organized as follows. Section [2] introduces the setting and assumptions. 
Section [3] presents the consistency and asymptotic normality results of our proposed estima- 
tors for the (joint) moments of the noise. Section [^demonstrates our results via simulations. 
Empirical studies are carried out in Section [5] in which we show by estimation and hypothe- 
sis testing that the noises are (moderately) positively auto-correlated for the stocks tested. 
Section [6] concludes. The proofs are given in the Appendix [A| 
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2 Setting and assumptions 



In this paper, we have three basic ingredients. The first one is the underlying process X, 
typically the log-price of an asset; the second one is the observation scheme, the third one 
is the noise. 

The assumptions on X are the standard ones in this kind of problem, namely is an 
ltd semimartingale, possibly discontinuous, plus some mild additional assumptions : so X 
is defined on some filtered probability space (ft, J 7 , (J t )(>o,P), and it admits the following 
Grigelionis representation: 

X t = X + f b s ds+ [ a s dW s + (51 m < 1} ) * - a)t + (Sl m>1} ) (2.1) 
Jo Jo 

In this formula, W is a standard Brownian motion, and j? is a Poisson random measure on 
R + x E, where (E, £) is a Polish space, with a non-random intensity measure of the form 
g (dt, dz) = dt <S> \(dz) with A a a-finite measure on (E,£). The above is the general form 
of an ltd semimartingale, and we assume the following on the optional coefficients b and a 
and the predictable coefficient 5 = 5(oj, t, z): 

Assumption (H): The process b is locally bounded, the process a is cadlag, and there is 
a localizing sequence (r n ) of stopping times and, for each n, a deterministic nonnegative 
function J n on E satisfying J J n (z) 2 \(dz) < oo and such that \5(lo, t, z)\ A 1 < J n (z) for all 
(u,t, z) with t < T n (oS). 

Next, we describe how observations take place. At stage n, that is for a given frequency 
of observations, the successive observations occur at times = T(n, 0) < T(n, 1) < • • • , for a 
sequence T(n, i) of (possibly random) finite times increasing to oo as i — > oo, so the number 
of observations up to time t is N n (t) + 1, where N n (t) = sup(i : T(n, i) < t). The minimal 
assumption on the observation times is that each T(n, i) is a stopping time, and the mesh 
goes to in a sense specified later, as n — > oo. Moreover, at time T(n, i) the process X is 
contaminated by some noise, meaning that we observe the variable 

Y i U = X T(n,i) + £ f, ( 2 - 2 ) 

where the noise is ef . 

For the sake of motivation about our forthcoming assumptions, we (temporarily) suppose 
that the noise is independent of X, centered, stationary, and with a negative exponential 
covariance. This covers a whole range of "natural" situations, the two extreme ones being 
as follows: 

1) Conditionally on the observation times, the covariance between e" and ■ is ae^ a '^ T ^ n,%+ ^' 
so the exponential covariance is in terms of calendar time and does not depend on the ob- 
servation scheme. 

2) The covariance between e™ and ef + j is ae~ a i: so the covariance between two values of 
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the noise depends only on how many observations (or, transactions) occurred in between 
the corresponding times. 



And, of course, there are mid-term possibilities, like the covariance being ae a ]Un with a 
"scaling" sequence u n going to slower than (T(n,i + 1) — T(n,i)). 

In the first situation above it is of course impossible to obtain consistent estimators for 
the characteristics of the noise, such as the covariance function (in the negative exponential 
well as in a completely general case), unless the horizon T goes to infinity. Such a 



setting has been studied, see e.g. Ukabata and Oya (2009), and here we are interested in the 



case where the horizon T is fixed. In the second extremal situation, and in all intermediate 
cases, it is in principle possible to consistently estimate the characteristics of the noise, 
under appropriate assumptions of course. The extreme case 2 above is obviously simpler 
than the intermediate cases, and should already provide useful insight, so below we focus 
on the second extremal case. 

Now, it is well known that, even in the absence of noise, analysis of the underlying 
process such as the estimation of the volatility is much easier when observation times are 
equally spaced, that is T(n, i) = iA n for a sequence of non-random numbers A n going to 0. 
And, in this case, when there is noise we can relax somehow the independence assumption 
between X and the noise. On the other hand, when the noise is indeed independent of X, 
the statistical analysis of the noise does not require equally spaced observations: this is 
especially interesting when observation times coincide with transaction times, those being 
of course not equally spaced (and the extremal case described above is then rather well 
suited to real problems). 

This is why, below, we consider two different sets of assumptions, which combine hy- 
potheses on the observation scheme and on the structure of the noise. 

Before stating these assumptions, and for completeness, we recall the p-mixing property 
of a stationary sequence (Xi)ieZ of variables, indexed by Z: letting Qj = o~(xi '■ i < j) and 
Q 3 = °~{Xi '■ * ^ i) De the pre- and post-cr-fields at time j, the p- mixing coefficients of x f° r 
k > 1 are 

Pk(x) = sup(\E(UV)\ : E(U)=E(V) = 0,E(U 2 )<1,E(V 2 )<1, 

U is C?o- measurarj le, V is C/ fc -measurable) , 

and we say that x is f-polynomially /3-mixing if Pk(x) < K/k v , where v is a number bigger 
than 1. Then, the two sets of assumptions are as follows, and both of them make use of a 
non-random sequence of positive numbers A n going to as n — > oo. 



Assumption (NO-1): For all T > we have 

the sequences sup i>1 (T(n, i) A T — T(n, i — 1) A T) , . 

and A n A r n (T)are bounded in probability. 

The noise (ef)j>o can be realized as e" = Xii where (x«)iez is a stationary, centered process, 
independent of the cr-field = \/t>oJ~t, and with finite moments of all orders, and which 
is u-polynomially p-mixing for some v > 1. 
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Assumption (NO-2): We have T(n,i) = iA n (regular observation scheme), and the noise 
(sf )j>o can be realized as 

= lT(n,i) -Xi, (2-5) 

where 7 is a nonnegative ltd semimartingale on (O, J 7 , (J-t)t>o, IP), which satisfies Assumption 



(H) (with of course different coefficients than in (2.1)), and (Xi)ieZ i s as m (NO-1) 



Remark 2.1 These assumptions could be weakened by asking finite moments up to a suit- 
able order only: for example, if one is interested in estimating the covariance function of the 
process x> we only need finite moments up to order q, bigger than but arbitrarily close to 4. 

The p-mixing condition could also be replaced by a-mixing or ^-mixing, or by any other 
condition implying ergodicity and a central limit theorem for all functionals of the type 
Eti f(Xi) when E(/( X0 )) = and E(|/( Xo )| 9 ) < 00 for all q > 0. □ 

Remark 2.2 Under Assumption (NO-2) the noise is not really independent of X, a form of 
dependency being induced by the presence of the process 7. However (NO-2) and a fortiori 
(NO-1) imply that the noise and the returns of X are not correlated: this is of course a 
drawback of the model used here. □ 

Remark 2.3 It should be noted that our model does not provide a definition of noise which 
is "consistent" with a change of observation times, in the following sense: when T(n, i) = 
i/n with n even, and when we subsample and take only the observations at times 2i/n 



(this amounts to replacing n by n/2), then in (2.5) we have to replace the process x by a 
new process \\ = \2i- This new process shares the same mixing properties as x, but the 
covariance is modified in a trivial way. □ 



3 Estimation of the moments of the noise 

We will be interested in estimating the various moments of the noise. For this, we introduce 
some general notation: let J be the set of all finite sequences of relative integers j = 
0l)j2) - "" Jq) (they are neither necessarily ordered, nor necessarily distinct, and q > 1), 
and we use the notation 



(jl; ' ' " 1 j<j)> j' — (fit ' ' ' ! j'qi 



<?(j) = Q, = m ax(ji, • • • ,j q ) 

- < jej' = (ji,--- JqJ'v- ,f q >) (3.1) 



, j+ m = {m + 31, ■ ■ ■ , ™ + j q ) if m G Z. 

We introduce the subset J + of J consisting of all j = (ji, J2, ■ • • ,jq) with j r > for all 
r = 1, • • • , q, and l 7 0+ is the set of all j = (ji,j2, • • • > jq) £ J + such that j\ = and q > 2. 
Associated with each j G J , we introduce the integer composite moments of the noise x 

as 

1 

m) =R(h,--- j q ) = E (n^>) if j = 0ii--- Ji)- ( 3 - 2 ) 

r=l 

Note that R(j) = when q(j) = 1, and R(j) = R(j +m ) for all m £ Z, so we restrict our 
attention to the estimation of R(j) when j € l 7 0+ . The covariance of \ ls > r (j) = R(®J)i 
the variance is i?(0, 0). 
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3.1 Consistency Results. 

For estimating we first choose a sequence k n > 2 of integ ers which satisfies, with A n 
as in pUj ): 

1 

fc n -»• oo, /c n A^ -> 0, for some fi 6 (0, -). (3.3) 

Then we set 

Kn j=0 Kn i=0 n i=0 n j=0 

(3.4) 

and for j = (ji, • • • , j g ) and /i = /i(j), consider the processes 

N n {t)+l-fJ,-2qhn q 

U^t = £ U&kr ~ F ^ + (2r-l) fe J- (3-5) 

i=0 r=l 

The index for Y n above is chosen to ensure that the noise components in YQ.^ and in 
Y™ + p+ (2r- l)jfc are se P ara ted by at least k n indices, implying that they are "independent 
enough". The sum above, as everywhere else below, is set to be when the upper limit is 
smaller than the lower limit, that is N n (t) < \i + 2qk n — 1, but for any t > this is not the 
case when n is large enough. The upper limit of the sum above is such that U(j)t uses only 
data within the time interval [0, t], and all these data. 
The consistency results are as follows. 



Theorem 3.1 Assume (H) and ( Sjty . Let j G J + and T > 0. 
(a) Under (NO-1) we have 



N n (T) 

(b) Under (NO-2) we have 



^ n Um A R(i) f T 7 f)ds. (3.7) 
Jo 

When both (NO-1) and (NO-2) hold, so jt = 1, (a) is a special case of (b). Also, under 
(NO-2), there is a fundamental non-identifiability, namely we can divide 7 by a number 
a > 0, and multiply % by the same a: this explains the form of the limit in (3.7), and in 
this case there is of course no way to estimate R(j) any better than up to a multiplicative 
constant. 

In the next subsection we will state Central Limit Theorems associated with these conver- 
gences. They involve some limiting variances-covariances, based on the following quantities, 
where j, j' G J + : 
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and we will see in the proofs below that these are finite numbers, and if J§ is a finite subset 
of J + the matrix (S J,J )jj'gj- is a covariance matrix. 

In order to have "feasible" CLTs we need consistent estimators for in case of (NO-1), 
and for T?^ ^,|0)+9(J ) ^ g j n cage Q f (NO-2). The previous theorem gives us such consistent 
estimators in case of (NO-1), and consistent estimators for R(j © j+ m ) Jq 1s^ +9 ^ ^ ds, but 
not for R(j)R(j') Jq^Ts ^ ds. For estimating the latter quantity, we do as follows. If 

j = (ji, • • • ,jq) and j' = • • • ,j' ql ) are in J+, with \i = and fj," = fi + and 
q" = q + q', we set 



N n {t)+l-ii"-{2q"+l)k n q q' 

^ II ffi+3r ~ YT i+li+{2r-l)k n ) H (*H-|K+(2g+l)fcn+# _ ^T+/y'+(2r+2,7)fc n , 

i=0 r=l r=l 

(3.9) 

Then we have: 



Theorem 3.2 Assume (H), (NO-2) and \3.3\ , and let'},}' G J + . If k n satisfies j[3.3\) anc 
T > we have 



A„i7a J')t A i?(j)E(jO / 7l (j)+9(j,) ^- 

•/ 



(3.10) 



We have a similar result under (NO-1), but this is not needed below. Coming back to 
the covariances S J,J , we have: 



Corollary 3.3 Assume (H) and §3jty . Let},}' G J+ and T > 0. 
fa,) Under (NO-1) and if k' n satisfies 

k n y oo, k n < /c n , A; n \Jk n Ajj ^ 0, 



(3.11) 



S JJ , where 



L-lrp 



M (tr(j © )')t + 2 £ U((j ej'+JSj - -^y 2 Um U(j%. (3.12) 



N n (T) 



771=1 



f&J Under (NO-2) and if k n satisfies (3.3) and k' n satisfies (3.11) we have ' n 
^'Jo7f )+g{i ' ] ds, where 



n 

E* J '' n = A n (t/(j©j')?+X;(^((j©j , f m )T+tr((j +m ©j / )?))-(2^ + l)A„I7(j,j')^. (3.13) 



m=l 



3.2 Central Limit Theorems. 



Suppose for instance that (NO-1) holds; by Theorem 3.1 it looks like U(j)j,/N n (T) properly 
estimates R(j), but it turns out, rather, that it is a good estimator for the following: 



9(J) 

R(k n ;i)=m(H(x jr 



t7" 

A /J (j) + (2r-l)fc r 



(3.14) 
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and there is a CLT with this centering term under exactly the same assumptions than in this 
theorem (and similarly when (NO-2) holds). As far as consistency is concerned this is not 
a problem because R(k n ;j) —> R(j), as we will show below. For the CLT with the desired 
centering i?(j), though, we need the convergence R(k n ;j) — > R(j) to be faster than the rate 

1/2 

of convergence, namely A„ , in the CLT. This will be the case if k n goes fast enough to oo, 



and more precisely if, instead of (3.3), we have the following, which is stronger than (3.3), 



and where v is the mixing exponent: 

k n A\[ 2v -> oo, k n A 9 n -> for some 0e (_,_). ( 3 - 15 ) 

Below, we state two different theorems, under (NO-1) and (NO-2) respectively, and J7b is 
any finite subset of J + . We also recall that a sequence of variables U n on (f2,.F,P), taking 
their values in some Polish space E, is said to converge T^-stably in law to a limit U defined 
on an extension (O, T, P) of the original space if, for any continuous bounded function / on 
E and any bounded J^-measurable variable we have E( X I / f(U n )) — > f(U)). 



Theorem 3.4 Assume (H), (NO-1) and (3.15). For any fixed T > the -valued random 
variables ZJ^ = (Z^' J )j g j- with components 

Hp = V / ^n(T)( ]v; ^ ) U(i)$-R(jj) (3.16) 

converge F^-stably in law to a centered Gaussian -valued variable defined on an extension 
(f2,.F, P) of the space, independent of J^, and whose covariance matrix is S JJ , as defined 



by (3.8). 



In this result, similar with (a) of Theorem 3.1 and in contrast with the next result to 
come, we do not have the functional convergence (as processes), and the limit is not even 
depending on T. 

Note also that we could as well consider the whole (countable) family J + instead of a 
finite subset J7b, if we consider the product topology on IR 1 . The same comment applies to 
the forthcoming result as well, but we do not need this kind of generality in this paper. 



Theorem 3.5 Assume (H) and (NO-2) and (3.3). For any fixed T > the R Jo -valued 
random variables Zip = (Z!p ,;> )j & j with components 

Z'^ = -J= (A n 170)$ " m) f 7l a) da) (3.17) 

converge F^-stably in law to a variable Z' T = (Z / (j)r)j g j 07 defined on an extension (Cl,J-, P) 
of the space, which, conditionally on Too, is centered Gaussian with (conditional) covariance 

E(Z!>Z'> | J^) = £•>••>' / 7 fCi)+?(i) ds. (3.18) 

Jo 



8 



These results, joint with Corollary |3.3[ also give us feasible CLTs, in the following sense: 
suppose for simplicity that J7q = {j} is a singleton. Then with notation (3.13), we have 



A n U(i)%-R(i) 
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!<J) ds 



AA(0,1) 



(3.19) 



under the assumptions of Theorem 3.5 (we even have the -stable convergence in law 



above). This is due, by standard properties of the stable convergence in law, to the fact 
the limit in probability of S^' n is an ^^-measurable variable. In the same way, under the 
assumptions of Theorem |3.4[ 



<N n (T) 

Z-Jrp 



N n (T) 



AA(0,1) 



(3.20) 



Remark 3.6 The choice of k n in (3.15) requires the knowledge of v, or at least of the fact 
that v is bigger than some known value v' > 1: in this case one may take k n X 1/ Al/ 2v . 
This is unfortunate, since in general one does not a priori know the law of Xi an d in 
particular whether it is stationary, or mixing, not to mention the number v for which it 
is f-polynomially /3-mixing. Nevertheless, nothing can be done without assumptions, and 
assuming that the unknown v is bigger than some fixed v' > 1 seems reasonably weak. In 
practice, one can choose k n in an ad-hoc manner: first pick a preliminary k n and check 
how fast the estimated correlations decay. If the estimated correlations decay fast, then 
one can possibly switch to a smaller k n , otherwise one may increase k n . One can also get 
some guidance from simulation studies, by coining a time series whose auto-correlations has 
similar behavior to what is observed in the real data, and then choosing different values 
of k n in the simulation study to see which values of k n work better. 



3.3 Estimation of the Covariance and the Correlation under (NO-1). 



In this subsection we assume (NO-1) and of course (H). A natural estimator for the covari- 
ance r(j) for any given j > is as follows (the time horizon T is fixed): we choose two 
sequences k n and k' n satisfying (3.15) and ( 3.11| ), respectively, and set 

r(j)n = U(0,j)%. (3.21) 

These estimators are consistent for estimating r(j), and enjoy a Central Limit Theorem with 
rate 1/ \J 'N n (T) and an asymptotic variance which is consistently estimated by 



T 



N n (T) 



U(0,j,0J)T + 2 U(0,j,m,j + mm - (2k' n + l)(f(j) n 



m=l 



Then we can rewrite (3.20) in this special case as 



IN n (T) 



(r(j)n-r(j)) A AA(0,1), 



(3.22) 



T 
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and, recalling that N n (T) is obviously known to the statistician, it is straightforward to 
construct confidence intervals for any r(j). 



Remark 3.7 (3.22) is not the end of the story about covariance estimation. Since Theorem 



|3.4| is a multivariate result, it is no problem to find confidence bounds for any finite family 
(r(0),r(l), • • • ,r(j)), although the formulation becomes messier when j increases. □ 

Remark 3.8 The choice of both sequences k n and k f n is connected with the numbers A n 
in (NO-1), which are a kind of mesh sizes, and also with the number v. The - annoying - 



connection with v has been discussed in Remark 3J) The connection with A n is even more 
annoying, in a sense: under (NO-1), these numbers A n are unknown (or, unobservable) , 
although they are supposed to exist. 

However, although we do not develop this topic here in a formal way, it can be shown 
that good proxies for A n are the observable numbers T/N n (T). Indeed, we can replace 



(3.15) and (3.11) by 



k n X iV n (Tf, k' n ^N n {Tf\ where 



2^ <??< 2' 



rf < 



1 — 7/ 



A 7]. 



(3.23) 



Then, although k n and k' n are now random, all the previous results still hold, if (NO-1) 
holds for a possibly unknown sequence A n — > 0: this is due to the fact that, since we are 
here analyzing the noise with the structure e" = Xh the calendar time is relatively of little 
importance in comparison with the index i enumerating the observations themselves. □ 

Remark 3.9 If one does as suggested in the previous remark, one is still left with the 
important problem of choosing the tuning parameters rj and rj' , and also the proper pro- 
portionality constants. This is exactly as in all statistical problems for which one uses local 
windows. □ 



Now, we turn to the estimation of the correlation between Xi anci Xi+j> f° r a fixed j > 1, 
that is 

r(j) 



Cor(j) 



r(0)' 



Using (3.21), natural (and consistent) estimators for this are 

r(j)n 



Cor(j) n 



r(0) r 



(3.24) 



(3.25) 



The associated CLT is a straightforward conseq uence of Theorem 3.4 used with j = (0, 0) 



and j' = (0, j). Namely, under (H) and (NO-1) and (3.15) for k n , N n (T) (Cor(j)„-Cor(j)) 



converges J^-stably in law to a variable which is J 7 00 -conditionally centered Gaussian with 
(conditional) variance 



S{j) 



r(0) 2 £( JM -0 + r (j) 2 £(°> M >°) - 2r(0)r(j) £( > )>(M 

r(0) 4 ' 
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With the notation ( |3.12[ ), consistent estimators for S(j) are 



1 'mi zp jU0 > j) ' n + ml ^ mm ' n - 2?{j) n ? ( ) n s T 



(0,0),(OJ),m 



At this stage, the following result is obvious: 



Theorem 3.10 Assume (H) and (NO-1), and let k n satisfy (3.15) and k' n satisfy (3.11). 
Then 



\h^n (Cor(i) n - Cor(i)) -4^(0,1). 
V 6 U)t 



(3.26) 



3.4 Estimation of the Covariance and the Correlation under (NO-2). 

From now on, we suppose that we are under (NO-2), with regularly spaced observations and 
the additional process 7. As mentioned before, we cannot estimate the covariance r(j), but 
we can estimate the "integrated covariance", which is 



K{j)t = r(j) I Hds. 

'0 



(3.27) 



Or, perhaps, one would like to estimate the "averaged" observed covariance lZ{j)x/T (this 
is of course the same problem), or the "spot" covariance r{j)^ at some time t within [0, T]. 

Despite its interest, we will not speak about spot covariance here, since this is somewhat 
similar (because 7 is supposed to be a semimartingale) to the estimation of the spot volatility. 
The estimation of TZ(j)x here is exactly the same problem as the estimation of r(J) under 



(NO-1): we choose two sequences k n and k' n satisfying (3.15) and (3.11) respectively, and a 
natural sequence of consistent estimators is given by 



K(j) n T = A n U(0,j)^. 



(3.28) 



The rate of convergence in the CLT is now y/ A n and, recalling the notation (3.9), the 
asymptotic variance is consistently estimated by 



%'(3)t = An (17(0, j, 0,j)t + 2 E U(0,j,m,j + m)?) - (2k' n + l)A n U((0,j), (0, 



m=l 



Then we can rewrite (3.19) as 

1 



A„S'(j)5 



(jim - nu) T ) 



JV(0,1). 



(3.29) 



More int eresti ng perhaps, in this case, is the estimation of the correlation Cor(j), which 



is given by (3.24) but also satisfies Cor(j) = for all t > (recall that 7 > 0). Thus, 



consistent estimators for this are 



(3.30) 
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Again, A— (Cor (j)S, — Cor(j)) converges J^-stably in law to a variable which conditionally 
on J-'oo is centered normal with variance 5'(j)t given by 



TZ(0) 2 T £(°.i)>(°J) + £(o,o),(o,o) - 2n(0) T n(j) T 



5'(j)t = ^^7^1 V ;i lUs 



T 



K(0) 4 T 

which is also equal to S(j) Jq jg ds/ ( 7^ ds) 2 . Then, consistent estimators for S'{j)T are 



(7£(0)j,) 



and we have the following: 



Theorem 3.11 Assume (H) and (NO-2), and let k n satisfy (3.15) and k' n satisfy (3.11) 
and T > 0. Then 



A„«S'(j)£ 



(Cor (j)t - Cor(j)) A AA(0,1). 



(3.31) 



4 Simulations 



Throughout the following two sections we take T = 1, in other words, we concentrate on 
intraday data. 



4.1 Under Assumption (NO-1) 

We consider the following design: X is an Ornstein-Uhlenbeck process with jumps 

dX t = -p(X t -fi) + adW t + dJ t , (4.32) 

where Wt is a standard Brownian motion, and Jt is a compound Poisson process independent 
of W as follows 

N t 
i=0 

where Nt is a Poisson process with rate A, and D^s are i.i.d. symmetric mixed normals: 
Di = Bi- Z'-, where Bi takes values 1 and —1 with equal probability 0.5, and Z'- ~ N(/j,', cr' 2 ). 
The observation times T(n, i) are specified as a Poisson process with rate n, and x 1S an 
AR(1) process: 

Xi+l = 4>Xi + Z i+ i, where Z i ~ iid N{Q,al). 

The observations are Y™ = Xx( n ,i) + e f = ^T(n,i) + Xi- 
The specification of parameters is as follows: 

p = 0.5, fi = 0.002, cj = 0.01, A = 3, // = ct/10, a' = a/30, 
= 0.8, (7o = 0.0003, and n = 93,600. 
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In the estimation, we choose k n = 8 and k' n = 4. Figure [T] compares the estimates of 
auto-covariances and auto-correlations based on Theorems 13.11 and 13.41 with the infeasible 
estimates based on the noise process and the theoretical values. The estimates are based on 
one simulated path. More specifically, in Figure [T] the dashed red curves report r(j) n as in 



(3.21) on the left and Cor(j) n as in (3.25) on the right; the dotted blue curves report the 
auto-covariance on the left and the auto-correlation on the right based on the simulated x\ 
the solid black curves report the theoretical values, i.e., 



r(J) 



and Cor(j) 



for j £ Z. 



(4.34) 



Auto-Covariances of Noise 



Auto-Correlations of Noise 



■ - Feasible estimates based on Y 

■ ■ Infeasible estimates b 
Theoretical Values 




— Feasible estimates based on Y 
■ ■ ■ ■ Infeasible estimates based on noise 
Theoretical Values 




Figure 1: Comparison of estimates of auto-covariances and auto-correlations. 

Figure [T] demonstrates that under Assumption (NO-1), our estimates are comparable to 
the infeasible estimates based on the noise process, which is the best one can hope for; both 
are almost indistinguishable from the theoretical values. 

Next we demonstrate the central limit theorems Theorem 13.41 and Theorem 13.101 We 
plot the normal quantile-quantile plots of 



as in (3.20) for j = 0, ■ 



lN n (T) 



, 8 in Figure |2l and 



Nn(T) 



(Cor(j)n - Cor(i)) 



as in (3.26) for j = 1, • • • ,9 in Figure |5J based on 1,000 replications. All the plots support 
that the normality established in the theorems can be relied on in practice with sample 
observed at a reasonably high frequency within the time period being considered. 
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Normal 0-0 Plot of Estimates of Oth Order Auto-Covariance 



Normal 0-0 Plot of Estimates of 1st Order Auto-Covar iance 



Normal 0-0 Plot of Estimates of 2nd Order Auto-Covariance 



-3-2-10 1 2 3 

Theoretical Quantiles 



-10 12 3 -3-2-10123 

Theoretical Quanliles Theoretical Quantiles 



Normal Q-0 Plot of Estimates of 3rd Order Auto-Covariance Normal Q-Q Plot of Estimates of 4 th Order Auto-Covar lances Normal Q-0 Plot of Estimates of 5 th Order Auto -Co variances 




Theoretical Quantile 



Theoretical Quantiles 



Normal 0-0 Plot of Estimates of 6 th Order Auto-Covariances Normal 0-0 Plot of Estimates of 7 th Order Auto-Covariances Normal 0-0 Plot of Estimates of 8 th Order Auto-Covar iances 

S M - 




-10 12 3 -3-2-10123 

TheoreticalQuantiles TheoreticalQuantiles 



Figure 2: QQ-plots of estimates of auto-covariances of orders from to 8, based on 1,000 
replications 



Normal 0-0 Plot of Estimates of 1 st Order Auto-Correlation Normal 0-0 Plot of Estimates of 2nd Order Auto-Correlation Normal 0-0 Plot of Estimates of 3rd Order Auto-Correlation 



Theoretical Quantiles 



I hecelical Quantiles 



Theoretical Guanti os 



Normal Q-Q Plot of Estimates of 4 th Order Auto-Correlation Normal Q-0 Plot of Estimates of 5 th Order Auto-Correlation Normal Q-Q Plot of Estimates of 6 th Order Auto-Correlation 




-3-2-10123 -3-2-10123 
TheoreticalQuantiles TheoreticalQuantiles 

Normal Q-Q Plot of Estimates of 7 th Order Auto-Correlation Normal Q-Q Plot of Estimates of 8 th Order Auto-Correlation Normal 0-0 Plot of Estimates of 9 th Order Auto-Correlation 

I « - 



-3-2-10 1 2 3 

Theoretical Quantiles 



-10 12 3 -3-2-10123 

Theoretical Quantiles Theoretical Quantiles 



Figure 3: QQ-plots of estimates of auto-correlations of orders from 1 to 9, based on 1,000 
replications 
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4.2 Under Assumption (NO-2) 



The X process is taken to be the same as in (4.32) above, namely, an Ornstein-Uhlenbeck 



process with jumps. Under Assumption (NO-2), we have an additional process 7, which we 
assume to be an Ornstein-Uhlenbeck process 



•P-yilt ~ M7) + °7 dW t , 



where Wt is the same Brownian motion that is used in (4.32). \ ls again an AR(1) process. 
The observations are Y/ 1 



T(n,i) 



X 



T(n,i) + lT(n,i)Xi- 



Note that in this case the 



and x are the same as in (4.33) 
We further take A„ 



0.01. 



The 
1/n := 



noise is dependent on the X process. 

The parameters for X, i.e., p, fi,a, X, fj,' ,cr' , 
parameters for 7 are p 7 = 0.5, /i™ = 1 and cr, 
1/93,600. 

In the estimation, we choose k n = 8 and k' n = 4. Figure [4] compares the feasible estimates 
with the infeasible estimates and the theoretical values. The estimates are again based on 
one simulated path. More specifically, on the left panel, we use red d ashe d curve to report the 
feasible estimates of the (scaled) auto-covariances based on Theorem 3.1 (lZ(j) n as in ( 3.28| )); 
blue dotted curve to report the infeasible estimates based on the noise process sf = ^iA„Xi 

(i.e., the auto-covariance based on the simulated x multiplied by j} 7I ds = A n X^ILo Tik„) ! 
black solid curve to report the theoretical values, i.e., TZ(j) as in (3.27). On t he right panel, 

(Cor(j) n as in 



we compare the feasible estimates of auto-correlations based on Theorem 



3.5 



(3.30); see red dashed curve) with the infeasible estimates based on the noise process (auto- 
correlations based on the simulated x; see blue dotted curve), and the theoretical values 



(Cor(j) as in (4.34); see black solid curve). 



Auto-Covariances of Noise Scaled by Gamma 



Auto-Correlations of Noise 





Figure 4: Comparison of estimates of auto-covariances and auto-correlations. 

From Figure [4] we see again that our estimates are comparable to the infeasible estimates 
based on the noise process, and both are very close to the theoretical values. 

Next we demonstrate the central limit theorems Theorem 13.51 and Theorem 13.111 The 
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normal quantile to quantile plots of 

J==(A n U(j)%-R(j) [ T n®d8 
vJ.J' n v Jo 



as in (3.19 ) for j = 0, - ■ ■ ,8 are plotted in Figure [5] and the normal quantile to quantile plots 
of 

' (cSr'(j)? - Cor(j)) 



A„«S'(j) 



T 



as in (3.31 ) for j = 1, • • • , 9 are plotted in Figure 6 based on 1,000 replications. Again, the 



practical applicability of the established normality is strongly supported. 



Normal Q-0 Plot of Estimates of 0th Order Auto-Covariance Normal O-O Plot of Estimates of 1st Order Aulo-Covariance Normal Q-O Plot of Estimates ol 2nd Order Auto-Covariance 



Normal 0-0 Plot of Estimates of 3rd Order Auto-Covariance Normal 0-0 Plot of Estimates of 4 th Order Auto-Covariances Normal 0-0 Plot of Estimates of 5 th Order Auto-Covariances 



Normal 0-0 Plot of Estimates of 6 th Order Auto-Covariances Normal 0-0 Plot of Estimates of 7 th Order Auto-Covariai 



Normal 0-0 Plot of Estimates of 8 th Order Auto-Covariai 



Figure 5: QQ-plots of estimates of auto-covariances of orders from to 8, based on 1,000 
replications 
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Normal O-Q Plot of Estimates of 1st Order Auto-Correlation 



Normal 0-0 Plot of Estimates of 2nd Order Auto -Correlation 



Normal O-O Plot of Estimates of 3rd Order Auto- Correlation 




Figure 6: QQ-plots of estimates of auto-correlations of orders from 1 to 9, based on 1,000 
replications 



5 Empirical Studies 

In this section we examine the dependence of the microstructure noise for several finan- 
cial stocks, in particular, we estimate the auto-covariances and auto-correlations and test 



whether they are equal to zero, based on Theorems 3.4 and 3.10 



5.1 Citi Jan 2011 Data 

We first analyze the tick-by-tick trade data of Citigroup Inc. (NYSE: C) in Jan 2011. The 
average observation frequency is about 246,000 per day (T = 1). The observation times 
are irregular (not equidistant). We assume that the Assumption (NO-1) is satisfied. We 
estimate both the auto-c ovaria nces and auto-correlations of orders to 30, using r(j) n as in 



(3.21) and Cor(j) n as in (3.25) (with k n = 8), for each of the 20 trading days and plot them 



in Figure [7] Each curve in Figure [7] represents one day. 
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Auto-Covar iances of Noise for Citi in Jan 201 1 (20 trading days) 




5 10 15 20 25 30 

lag 



Auto-Correlations of Noise for Citi in Jan 201 1 (20 trading days) 




Figure 7: Estimates of autocovariances and auto-correlations for the tick-by-tick trade data 
of Citigroup in Jan 2011. Each curve is for one trading day, and plots the estimates of 
auto-covariances (upper) or auto-correlations (lower) of orders through 30. 



The auto-correlations appears to decay in an exponential way, so we can assume that 
the mixing condition that we put in Assumption (NO-1) is satisfied. We see from the results 
that the noise is not un-autocorrelated; in fact, positively autocorrelated for all the days 
under study, at least for small lags. 

Based on Theorem |3.4| we can further test whether the auto-covariances are equal to 
zero. More specifically, under the null hypothesis 



#0 : r(j) = 0, for j > 1, 



with j = (0, j), we have by Theorem 3.4 (and (3.20)) that 



lN n (T) 



y\J>J> 



N n (T) 



AT(0,1) 
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So we can compute the p- value for testing Hq : R(j) = as P(JZ| > W gy, n jy (T) ^(j)rj j 

where Z ~ ^(0,1). For this dataset, we take k n = 8 and /c^ = 4 in estimating T^ ,n . 
The p-values turn out to be all close to (most are extremely small, the biggest one is 
about 0.035), for all orders up to 30 and for all the 20 trading days under consideration. In 
particular, since all the estimated auto-covariances are positive, the results also imply that 
if one conducts a one-sided test 



H : r(j) < 0, for j > 1, 



then one rejects these hypotheses at 0.05 significance level for all orders up to 30 for the 
data under study. We hence conclude that the auto-covariances are statistically significantly 
different from 0, and actually, statistically significantly bigger than 0, for all orders up to 30 
and for all the 20 trading days under consideration 

Based on Theorem 13.101 we can conduct tests for the auto-correlations. The results turn 
out to be the same as above, namely, the auto-correlations are statistically significantly 
different from 0, and actually, statistically significantly bigger than 0, for all orders up to 30 
and for all the 20 trading days under consideration. 

Theorems 13.41 and 13.101 also allow us to build confidence bands for the auto-covariances 



and auto-correlations. More specifically, based on Theorem 3.4 we can build 95% confidence 
bands for the auto-covariances r(j) as 



± 1.96 



NJT) 



0,1,2,. 



(5.35) 



And similarly Theorem 3.10 yields 95% confidence bands for the auto-correlations Cor(j) as 



Cor(j)n ± 1-96 • 



N n (T) 



J = 1,2, 



(5.36) 



Applying these formulae to the data on January 3, 2011 we then get the confidence bands 
for the auto-covariances and auto-correlations of the noise, which we plot in Figure [8] below. 
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Figure 8: Confidence bands for the auto-covariances and auto-correlations for the tick-by- 
tick trade data of Citigroup on January 3, 2011. 

We can then conclude that, for example, on January 3, 2011 the auto-correlation of the 
noise of any order up to 5 is greater than 0.15 or so, with 95% confidence. 



5.2 Sprint-Nextel Jan 2011 Data 

We next examine the tick-by-tick trade data of Sprint-Nextel Corporation (NYSE:S) in 
Jan 2011. The average observation frequency is about 55,000 per day (T = 1). Assuming 
that the Assumption (NO-1) is satisfied, we estimate both the auto-co varian ces and auto- 
correlations of orders to 30, using r(j) n as in (3.21) and Cor(j')„ as in (3.25)(with k n = 8), 
for each of the 20 trading days and plot them in Figure [9] Again, each curve in Figure [9] 
represents one day. 
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Auto-Covariances of Noise for Sprint-Nextel in Jan 201 1 (20 trading days) 




Auto-Correlations of Noise for Sprint-Nextel in Jan 201 1 (20 trading days) 




5 10 15 20 25 30 



Figure 9: Estimates of auto-covariances (upper) and auto-correlations (lower) for the Sprint- 
Nextel trade data in Jan 2011. Each curve is for one trading day, and plots the estimates 
of auto-covariances or auto-correlations of orders through 30. 



We see similar phenomena as above, namely, (1) the auto-correlations decay fairly 
quickly, and (2) that the noise are not un-cor related; in fact, positively correlated for all the 
days under study. 

One can also conduct tests as in the previous subsection. The test results are similar: 
for testing either the auto-covariances or auto-correlations equal zero, the p— values are all 
extremely small (all smaller than 10 -5 in this case), and hence one can again conclude that 
the auto-covariances/auto-correlations are statistically significantly different from 0, and 
actually, statistically significantly bigger than 0, for all orders up to 30 and for all the 20 
trading days under consideration. 

We can further construct confidence bands for the auto-covariances and auto-correlations, 
using the formulae (5.35) and (5.36), just as in the previous subsection. The resulting bands 
are plotted as follows, again for the day of January 3, 2011: 
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Confidence bands lor Ihe auto-couariances ol Noise lor Sotint-Nettel on Jan 3,2011 



ConliOence bands lorlne auto-correlations ol Noise for Sprint-Nextel on Jan 3, 2011 



Figure 10: Confidence bands for the auto-covariances and auto-correlations for the tick-by- 
tick trade data of Sprint-Nextel on January 3, 2011. 



Based on Figure 10 we can conclude that on January 3, 2011 the auto-correlation of the 
noise of any order up to 5 is greater than 0.25 or so, with 95% confidence. 



6 Conclusion and discussions 

In this paper we study the estimation of the (joint) moments, in particular, the auto- 
covariances/auto-correlations of the microstructure noise, based on high frequency data. 
We establish consistency as well as central-limit theorems for our proposed estimators. Sim- 
ulation studies demonstrate that our estimators perform well. Empirical studies are also 
carried out, in which by estimation and hypothesis testing that for the stocks tested, the 
microstructure noises are not uncorrelated, but are actually (moderately) positively corre- 
lated. 

When the noises have general auto-correlations, the existing theory based on i.i.d. noises 



or noises of other simple specific forms has to be modified. In Jacod et al. (2013), the authors 



study how the noise structure affects the estimation of volatility, and propose a volatility 
estimator under Assumption (NO-2). Much more has to be done to better understand the 
impact of the dependence structure of the market microstructure noise to further financial 
applications. 



A Proofs 

Before starting the proof we mention that, by using a classical localization procedure, for 
proving all the previously stated results, we can replace the three assumptions (H), (N0-1) 
and (NO-2) by the following stronger assumptions: 



Assumption (SH) We have (H), the processes b and a are bounded, and the stopping time 
T\ is identically infinite (so |<5(u;, t, z)\ A 1 < J(z), where J = Ji). 
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Assumption (SNO-1) Wehave(NO-l) and T(n, i)-T(n, < AA n and A n N n (t) < A t 
for all t, for some constants A and At- 

Assumption (SNO-2) We have (NO-2) and the process 7 satisfies (SH) and is bounded. 

We always assume these strengthened assumptions below, mostly without special men- 
tion. We also always assume, without mention, that (xi)iez is as m (NO-1). In all the 
sequel, the constant K may vary from line to line, but does not depend on n and on the 
various indices i,j,---. 

Whether the noise and the underlying process X are a priori defined on the same space 
or not is irrelevant for the results. However, for the proofs it is convenient to suppose that X 
and 7 are defined (and satisfy the relevant assumptions) on a space (O^ \ (^-j)^>Q , p(°J ) s 
whereas the sequence (xi)iez is defined on another space (f^ 1 ), G, (Gi)iei,, IP*- 1 )), with Gi = 
°~(Xk '■ j < i) an d G l = o~(xk '■ J ^ *)) an d we set 

n = n^xn w , t = t 00 ^g, p = p (0) ®p (1) . 

As usual, any variable or process or <r-field on Qv) for j = 1,2 is also considered, with the 
same notation, as defined on the product f2. 

The space (ftW, jW, {Gi) ie zM l) ) is naturally endowed with a measure-preserving and 
invertible transformation such that Xi+j = Xi° 9 J f° r au h3 £ ^1 an d $ is ergodic by 
the p-mixing property. Let us also recall a consequence of the definition of the mixing 
coefficients pj, and of the product structure of the space (Q, J 7 , P). We set Hi = J-00 ® Gi- If 
£ is a centered square-integrable variable on (OW, (Gi)ieZ:^ )> which is measurable 
with respect to G° = o~(xi : i > 0), we have 

E(|E(eo^ \Hi)\ 2 ) < P }m 2 ) < — (a.i) 

This yields the following useful estimate: if £ is ^-measurable and £' is £/ J+:, -measurable, 



both square-integrable, by (2.3) applied to U = (£ o 6 1 — i?(£))/-y/var(£) and U' = (£' o 
-< - E(g))/y/vai(£'), plus p k (x) < K/k v and E(U) = E(U') = and by the stationarity, 
we have 



I m O I = HO E (0 + E(0 E(U') Vvar(f) + E(0 E(C7) ^var(0 
+ E([/C/ / ) ^var(£) var(f) | 

< I E(0 E(0 1 + ^ VE(£ 2 )E(f2). ( A.2) 

Finally, recall that, if V is a semimar tingale on (J2(°), (J"t) 

t>o?P^) which satisfies 
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(SH), then for any finite stopping time S we have for q > 2 and t > 0: 



E(V s+t -Vs\Fs)\< Kt, E( sup \V S+S - V s \ q \ T s ) < K q t. (A.3) 

■ se[o,t] 



A.l Some Result about Stationary Processes. 

In this subsection we consider a sequence £ n = (^ riJ )i<j< ( i of d- dimensional variables on 
the space J 7 ^ ,¥^), satisfying the following, where w n and w are integers with w n > 

w > 0: 



C, E(e) = 0, sup neN E(||rin <oo Vp>0 



£ n is measurable with respect to the <7-field Q° n £7 Wn = <x(Xi : < i < w„) 



(A.4) 



£ is measurable with respect to the d-field Q° ' C\Q W = <r(xi : < i < w). 
Note that E(£) = 0, whereas w n — > w is noi assumed. We write £™ = £ n o 0* and £j = £ o 9 l . 



Then ( |A.1| ) and ( |A.2| ) yield 
E(||E(C +J |^ m )|| 2 ) = 



E(||E(c +J |a m )|| 2 ) < jns(||H 2 )A* < K/i^ 



neJe m k +l )\ < mu n \\ 2 )/((i - o v ir < - «g v i)* 



(A.5) 



if i > 1, and the same for £ m +j with w; n replaced by w in the second inequality. Since v > 1 
we deduce that the following define a covariance matrix: 



(A.6) 



i=l 



In the simple situation where £ n = £ and u; n = (so £ is a function of xq)> a trivial 



multi-dimensional extension of Corollary VIII. 3. 106 of Jacod and Shiryaev (2003) yields a 
Central Limit Theorem which says that 



[tu n ] 



B. 



(A.7) 



for any sequence u n — > oo, where 

a d-dimensional Brownian motion with covariance E(5| S*) = a J& . (A. 8) 



We need to extend this result when £ n depends on n, subject to (A.4), and w n > 0; this 



leads us to consider the following processes, where as above u n is a sequence tending to oo 
and u' n > is another sequence of integers such that u' n /u n — > 0: 



<j<d, 



where G^ 3 



1 



[Unt]— u' n 



i=0 



(A.9) 
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This will accommodate Theorem 3.4 whereas for Theorem |3.5| we additionally have random 
weights. In this case the observations are equally spaced, and we need a normalization 
connected with the "calendar" time. So we set, with aj,>fla sequence of integers such that 
u' n A n -> 0, 



[*/ A «]-< 

H n = (H n j)x<j<d, where = £ V L 



(A.10) 



8=0 



where V = (V J )i<j<d is a d-dimensional bounded Ito semimartingale satisfying (SH) on 



Theorem A.l (a) Under (A. 4) and (NO-1), and if u n — > oo and w\ju n — > and u' n /u n 



, for any t > the variables Gf converge in law to M(0, at), with the matrix a defined by 



(A.6). 



(b) Under (A. 4) and (NO-2), and if w^A n — > and u' n A n — > 0, for any t > the 
variables converge F^-stably in law to a variable Ht defined on an extension (f2, J 7 , P) of 
the space, and which conditionally on J 7 ^ is centered Gaussian with (conditional) covariance 



VI V? ds. 



(A.ll) 



A way of realizing the limit H above is to take B as in (A. 8) and independent of Too, 
and to put 



V2dB{. 



(A.12) 



Proof. 1) We start with (b), which is more complicated than (a). By (A. 5 ) and the Cauchy- 



Schwarz inequality, Ei>l E (ll E (£" I ^ 

m )||) < oo for any m, whereas V is bounded, so the 
following (i-dimensional variables and are well defined, componentwise: 



Tjn,j _ 



and we write MJ^ for the same variables as M^, with £ n substituted with ^. Since £P is 
Ui +Wn -measurable, we have E(£f | H[ t / An ]+w n -u' n +i) = £? when « < [t/A n ] - u' n , hence 



H t - M \t/A n ]+w n -u' n +l + U ~ U \t/A n ]+w n -u> n +l 



(A.13) 



2) In this step we prove that, for any t > 0, we have 



V, 



[t/A»]+u>„-<+l 



0, and U r Q 



(A.14) 
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We shall only prove the first convergence as the second can be proved similarly. To this 
end, we write m n = [i/A n ] + w n — u' n + 1, and since u' n A n — > and t > we may assume 
m n > w n . Then we have U£ = A n + B n , where 

oo m n 
i=l+ m „ i=m n —w n 



Since V is bounded, we deduce from (A. 5) and Cauchy-Schwarz inequality that 



E(||A„|| 2 ) < KA n Yl E (H E (£P I WrOII ll E (^i I HrrJW) < KA r 

i,j=l+m n 



hence A n — > 0. Next, we have E(||£™|| 2 ) < K by (A. 4), hence E(||l? n || 2 ) is obviously smaller 
than KA n w n because the sum defining B n contains w n terms. Since A n w n — > we deduce 
B n — > 0, hence the first convergence in (A.14). 

3) In this step we prove 



\ lvl [t/A n ]+w n -u' n +l IV1 [t/A n ]+w n -u' n +l\ 



0. 



(A.15) 



Setting = £™ — £i and again m n = [t/A n ] + w n — u' n + 1, we observe that M^ n — M™ n 
HT=iVk^ where 



j>0 



is a martingale increment, relative to the discrete time filtration (Hk)k>0i hence E((M, 



M 



YlT^i^ii^k ) )• successive conditioning, 



£r=i E (« J ) 2 ) = a„e( ET=i E M > ni„ (ECC I «*) E (C J I **) 

-E^'lm-OE^'lHfe-i)) 

and the double series ^ ■ z inside the expectation above is absolutely convergent (almost 
surely). Hence we may permute the order of summation over (i, I) and over k, and thus get 
E((M^ - M^) 2 ) = D n mn - Dq, where 

di = a„e( y, vL vL n nc ,j I «*) E (C J I «/ 

i,Z>0 
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We have = Ylr=i D(r) k , wnere (with an empty sum set to 0) 



mr k 
mi 
mi 
mi 
mi 
mi 



^(p^ k+1 {v^j(nc 3 \n k )) 2 

^{zo^-^iv^jHm?' 3 1 n k )f) 

2A n E( Eov(Jfe-«,„) <i<l<k 
2A n E( £ 



0<i<fc— iu, 



Since V is bounded, flA~5j ) applied with £' n and with a n = E(||£ n - £|| 2 ) yields D(r)]J < 
KA n a n for r = 1, 4 (recall f > 1). Next, one has D(r)^ < K A n a n w n for r = 2 and also for 
r = 5 by applying (A. 5) again, whereas -D(6)jJ < i^A n a n w; 2 . Finally, since E(^ n | 7i k ) = ^ n 
when i < k — w n , one has D(3)^ < KA n a n (l + (k — w n ) + ) and one can rewrite D(7)^ as 

Dil)l 



and another application of (A. 5) yield -D(7)J! < KA n a n (l + (k — w n y). Putting all these 
estimates together (for k = and for k = m n , and since w n > 1) gives us 

E((M^ - M n m i) 2 ) < KA n a n {w 2 n + [t/A n ]). 



We have a n — > and A n w 2 — > by hypothesis, and (IA.15 ) follows. 



4) In view of (A. 13), (A. 14) and (A. 15) it remains to prove the J^-stable convergence 
of the variables M? t ,^ , +w _ u , +1 , and we actually prove a stronger result. Namely, we will 
show the J^-stable convergence of the processes M™ t ^ A , to a process H which conditionally 
on is a centered continuous Gaussian martingale with covariance given by (A.ll) for 
any t > (since A n (w n — u' n + 1) — > this implies the convergence of M™ t ^ A _ u , +1 
toward Hf). 

As in Step 3, Mf = Y! m =i C> where 



i=0 



and each is a martingale increment. Hence, if 



E(C J C' fe I Wm-i), c(e)» = E(||CH 2 l{||ail> £ } I ^ 



m— 1J) 
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we deduce from Theorems VIII. 3. 22 and VIII. 5. 14 of Jacod and Shiryaev (2003) the J- c 



stable convergence of the processes M™ t ^ j to H, as soon as we have the following two 
properties, for all t, e > 0: 



[t/A„] 

E 

m=l 



[t/A„] 

E 

m=l 



C £ 



(A.16) 



The second one is easy to prove. Indeed, if j3 m = || A,m II > we have || Cm II — Ay/A n /3 m 

for some A > 0: we allow the index i to be negative, so that we can apply the obvious relation 
Pi m = Pi—m,0 ° $ m (f° r all i, m G Z) to obtain j3 m = /3q o 9 m . A priori (3 m could be infinite, 
however /3j )Jn = when i < m — w by (A. 4), so (A. 5) for £ implies that E((/3 m ) 2 ) < K. Then 
by stationarity 

E(c(e) m ) < A 2 A n E(^ l { ^ >e/(Av ^ )} ) = A 2 A n a(e) n 
where a(e) n = E((/3 ) 2 ^0 Q>e/{A ^ )} ) ■ 



Now, a(e) n -4 because E(/3 2 ) < and the second part of (A.16) follows 



5) By virtue of the scjuare-integrability of the d-dimensional variables /3 m — /L^>o Pi,Tn 
are well-defined, square-integrable, and also (3 m = (3 w+ i o Q m ~ w - 1 f or all to > u; + 1 (this is 
wrong when 1 < m < w). In this step, we show that 



WL Pm)\ ^ ^ and if m > w + 1, then E(/3^ /3 m ) = a 



(A.17) 



with a- 7 given by (A. 6). The first estimate follows from ||/3 m || < (3 m = /3q o # m and /3q 6 L 2 . 
For the second property, by polarization it is enough to show it in the one-dimensional case 
d = 1, and so below we omit j,k. The variable /3j m = E(£j | % m ) — E(£j | Tl m —i) is im- 
measurable with vanishing % m -i- con ditional mean, whereas ^jE^i | H m ) = (£jE(£; | 
Hm-i)) ° Then 



E(A,m A : 



E(&A, TO ) =1(1(6-6+1 | ft m )E(& | H m ))+E(E(Ci+i | H m )E(&-£ m I ft m )) 



hence for any L > 2m: 
L L 



J2 E(ft,mW = ^E(E(£o | Hm)E(6+6+i I ^ m ))-^E(E(£ L+1 | ft m )E(6+6+i | Wm)) 



iJ=0 



i=0 



Z=0 



By (A. 5) the Zth summand in the last sum above is smaller in absolute value than K/L l 



always, and than K/L v l v when I > 2m. Since v > 1, by letting L — > oo we obtain that 



X>(E(£ |ftm)E(6 + &+i |« m )) =E(e 2 ) + 2^E(C ^), 



z=l 
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the last equality following from the fact that m > w + 1, hence £o is % m -measurable. The 
right side above is (A. 6) in the one-dimensional case, and thus the last part of (A. 17) holds. 

6) In this step we set c'm = ^(Pm /?m | Hm-i) and prove that 



[t/A n ] 



(A. 18) 



m=l 



Letting rf^ be the mth summand above, we see that rf^ = A n ^ ;>0 rj(i, Z)J^, where 



V(i,l)l = (4 Vf An - Vi m _ 1)An <_ 1)A J m, m Pt 



H 



m—l I 



As seen before, /3j jm = when i < m — w and E 
than K/[i — m) 2v when i > m; hence by (A. 3) and (A. 5) we obtain if i < I 



) is smaller than K always, and 



if i < m — w 

lAy/ A n ((l-m)V\m-l-i\) 



n\v(hi) n j)<{ 



K 
K 



(l-m) v 
lA^A n {l-m) 



(i-m) v (l-m) v 

kja^,. 



if m — w < i < m < I 
if i > m 

if m — w < i < I < m, 



and similar estimates hold when I < i. Since one can always assume v E (1,3/2), in which 
case Yli>i( l A Vi&n)/i v < KA v n ~ l , we get E(|^|) < KA v n , and (|A.18|) follows. 



7) By the previous step, in order to get the first part of (A. 16) we are left to show 

(A.19) 



[t/A„ 



A„ ^ V ( m -l)A n V (m-l)A n d 



in 



m=l 



V'Vfds. 



The left side above can be considered as the integral of the cadlag function s i— > Vi V k with 



respect to the (random) measure F^ k {ds) = A n Ylm=l S c '™ d(m-i)A n (ds), where 8 X stands 
for the delta measure at x, so it is enough to show that F^ k converges in probability to 
the measure a jfc l[o,t]( s ) ds. To this aim, it is is enough to show that 



[i/ A n ] /jk 



s/A n ] 



<t => G n s := A n £ 4 fc 



a jk s 



(A.20) 



m=l 



(this is obvious when k = j, because then F™^ k is a positive measure; when k ^ j it may 
be a signed measure, but with an absolute value dominated by \ (F^'^ + F™' ), so again 



(A.20) is enough). 

We recall that /3 m = (3 w +\oQ 
it is obviously enough to show the convergence (A.20) when the sum starts at m = w + 1. 



m—w—l 



when m > w, implying d m 



rJ Qrn-w+l 
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Then, the ergodic theorem and (A. 17) tell us that G™ converges a.s. (locally uniformly in 
s) to a? s. This completes the proof of (b). 

8) Now we turn to (a). This is basically the same as (b), with the processes V 3 being 
identically equal to 1, and with the convention A n = l/u n (indeed, in this case, the calendar 



time and the observation times T(n,i) play no role at all, and neither does J-^; so (2.4) is 
irrelevant, and we can set A n = l/u n ). So all Steps 1-7 can be reproduced, except Step 6 
which is irrelevant, whereas in Step 7 we can proceed directly to (A.20). □ 



We will also need bounds for the moments of the processes G n and H n : 



Lemma A. 2 Under (A. 4 ) and ifV is bounded, we have 



(NO-1) 
(NO-2) 



E(||G?II 2 ) < K{l + w n )t 
E(||fff|| 2 ) < K{l + w n )t. 



Proof. Upon setting = 1 and u n = 1/A n , the case (NO-1) reduces to the case (NO-2). 
By singling out each component H n ^ we can assume d = 1. Then we have E(|.£f™| 2 ) = 
4>n(t) + ipn(t), where 



(i+w n )A{[t/A n ]-u> n ) 



0<i<i+iOi 



,<l<[t/A n ]-u' n 



On the one hand, since V is bounded and (A. 4) holds, |</> n (£)| < K(l + w n )t follows 
from the Cauchy-Schwarz inequality. On the other hand, is ^j +u , n -measurable, so by 



conditioning first with respect to J 7 ^ we deduce from (Ag that \E(V iAn gV lAn tf)\ < 
Kj(l — i — w n ) v when I > i + w n . Since v > 1 one deduces ip n (t) < Kt, and the result 
follows. □ 



Finally, we need to consider processes that are slightly more general than H n , at least 
in the one-dimensional case. Namely, we assume (A. 4) with d = 1, and we are also given 
another set (£ /n , w' n ) satisfying (A. 4) as well (we do not need a limit £' here), plus an arbitrary 



sequence of integers p n > 1. With the same auxiliary bounded process V and sequence of 
integers u' n as above, we set 



[*/An]-< 

i=0 



(A.21) 



Lemma A. 3 In the above setting, and under (NO-2), we have 

t 

A n p. 



IVr. 



E(\H?\ 2 ) < K(l + w' n + -^ + 



;2l< 
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Proof. Set i'( n = ^^ n +w n + Pn - B y P^f and a PP lied repeatedly, we check that for 

I > 0, 



E(Ce^i^oo)|< 



K/(l - w' n ) v if <</<< + p„ 

K/p v n if W n + p n < I <w' n + p n + W ri 

K(l/p 2 n v + l/(/ -W' n -W n - PnY) if / > < + p„ + W n . 



Since V is bounded, we have 



[t/A n ] [t/A n ] 

E(\H?\ 2 ) <KA n Yl £ E (l E (CO I ^oo)|). 
t=0 Z=0 

Then by splitting the sum over I according to the four cases described above, we obtain the 
result. □ 



A. 2 Further Auxiliary Results 

In this subsection we gather a few results of a technical character, to be used at several 
places. 

1) The first of these results is about asymptotically negligible triangular arrays. The setting 
is as follows: for each n we have a discrete-time filtration ('%f)i>o> an integer w n > 1 
(typically, w n —> oo), and a sequence (5")j>i of random variables. 

Lemma A. 4 In the above setting, and if further each 8f is T~Lf +w -measurable, we have 



®(™Ps<t\EUi n] 6?\) < K(^ + 2^ 
where a n = sup^ E(| E(<5f | «?)|), a' n = sup^j E(|5™| 2 ). 



(A.22) 



Proof. When w n = we have a n = sup,^ E(|£™|) and the result is obvious. When w n > 1 
we let 5[ n = E(6? \ Hf) and 5'( n = 6? - 5[ n and, for j = !,■■■ ,w n , 



[(t-jA„)/(w„A n )] 
i=0 

The summands above are martingale increments, relative to the filtration (Hj'+i Wn )i>o, hence 
by Doob's inequality 

E(sup |A(i)«| 2 ) < 4 *^. 
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Observing that A? = ^fif " ] <5? satisfies that A? = X) -=f " ] <5f + Ej=l ^0")?. we deduce 
that 

[*/An] u>„ 



E sup K| < ^ E(|CI)+E E ( su Pl A W"l 
i=l j=l 

and the result readily follows. □ 



2) Our second auxiliary result mainly compares R(k n ,j) defined in (3.14) with R(j) in (3.2). 



Lemma A. 5 7/j G J we have \R(k n ;j) — R(j)\ < K/k^ (for a constant K depending on'}), 

P 

hence in particular R{k n ;'}) — > R(i), as n — > oo. We also have Xu n — ^ f or an D sequence 
u n of integers. 

Proof. Letting j = (ji, • • • ,j q ) G and fi = /u(j), and denoting by Q the set of all non- 



empty subsets Q of {1, • • • ,q}, the complement of Q being denoted as Q c . (3.2) and (3.14) 
yield 

R(k n ;i)-R(j) = ^(-1)I«I<MQ), where 0„(Q)=e( [] X jr II ^+(2r-i)fc„) . 

where \Q\ denotes the cardinal of Q. We fix Q G Q and let ro = maxQ and Q' = Q\{r$}. 
Then 

<f>n(Q)=H*n(Q)x; +i 2r -l)k n ), where $ n (Q) = [] X > II ^+(2r-i) fc „- 

reQ c reQ' 

The variable 3>n(Q) is ^ t+ 2(r -i)fcn" measura ^ e ' with E($ n (Q) 2 ) < i^, so by the Cauchy- 
Schwarz inequality 

\MQ)\ < K^/E(\E(x; +{2r ^ 1)kn \G» +{ 2r -2)k n )\ 2 ). 
Since X^ + 2(r -i)k ^ s ^ /i+ ^ 2ro_1 ^"- meas urable, centered, and with a second moment bounded 



in n, it follows from (A.l) that |0 n (Q)| < K/k^- Summing up over all Q G Q, we deduce 
the first claim. 

Finally we observe that E((x™ n ) 2 ) is independent of u n and equal to p- Ylo<i j<k n r {^~ j)i 
which in turn is smaller than ^- XmeZ l r ( m )l < K/k n , and the last claim follows. □ 

3) For our last auxiliary result we suppose (SNO-2) and consider j = (ji,-- - ,j q ) and 
j' = (fi, ■ ■ ■ ,f q >) in J, and set fx = //(j), // = fi(j'), fi" = fi + fi' and g" = q + g 7 , and also 
a n (t) = N n (t) + 1- fi - 2qk n and a' n (t) = N n (t) + 1 - ft" - (2q" + l)k n . The following 
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processes are the same as U(j) n and f7(j,j') n , when there is only noise and the process 7 is 
properly "frozen": 

a n (t) q 

W(j)?= £ liA n n(Xi+>-X-V M+ (2r-l)fc») 
i=0 r=l 



W(j,j)"= £ 7*A* 11 (Xi+ir - X?+^+(2r-l)feJ II (Xi+,tt+(2g+l)fc„+# ~ X™ 
i=0 r=l r=l 



i+/i"+(2r+2g)fc„) 

(A.23) 



Lemma A. 6 Under (SH) and (SNO-2) we have 
E( S up s < t |C/(j)£-«(j)£ 



/or any p > 1, where K p depends on p, and on j, j' through q, q' only. 

Proof. Since t/(j) n = U(j,$) n and U(j) n = U(j,$) n (with the convention that an empty 



product is equal to 1, in (A.23) for example), only the second claim needs to be proved. 



1) The first step is devoted to some estimates. Set for u,l,w G N: 



C(i; u, l)f = X( i+u)An - xf +l , 

C(3; U, 1)2 = -i Em=0 1 (7(,+I+m)A„ - liA n )Xi+l+r 



C(2;«,0? = (7(i+u)A„ - 7iA„)Xi+« 
C(4; u, I)? = 7iA„ (Xi+u - Xf+z) 



(note that ("(2; u, Z)™ does not depend on I and £(3; u, l)f does not depend on u). Upon using 



the second part of (A. 3) with V = X or with V = 7, plus the independence of J-oo and 



Q and the fact that Xi has moments of all orders, plus Holder's inequality, we get for any 
P > 2: 

E(\((l;u,l)f\P \ T iAn ) <K p A n (u + l + k n ), E(\((2; u, I)? |* | T iAn ) <K p A n u 
E(\((3;u,l)?\r\T iA j <K p A n (l + k n ), E(|C(4;u,f)? \ p I ^aJ < K p . 

(A.25) 

One also has the following: 

j = 1,2,3 => e(|e(c(j;«,0?I ^a„®<?)| 2 ) <^(« + / + g 2 , (a.26) 

which we prove for j = 3 only, the cases j = 1,2 being similar (and even simpler). Indeed, 



(A. 3) again and the independence of J-oo and Q yield 



h n 1 



E(C(3; u, 1)2 I T iAn ® G) I < KAn [ l : + 53 l^+ml 



m=0 



The moments of x being finite, one deduces the second part of (A.26). 
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2) By definition, U(j, j')? - U(j, j')? = Efi^tf, where 



9 _ ?' _ 

£f = EI ( Y i+j r ~ ^i+jti+(2r-l)fcj n (^+^+(2g+l)fc n +i; ~ ^I+/i" + (2r+2g)fc n ^ 



-7ik? n(x i+jV ^i+u+(2i — l)k n 

) n (xh 

r=l r=l 



)• 



We will rewrite this in a more convenient way. If 

1 < r < q => u?=j r , I? = n + (2r - 1) 

q<r <q" => u? = M+(2<Z+ £ = m" + 2r£; n , 



we have 



— _ ^i+z?) _ c(4; i")". 



r=l 



r=l 



Since Y^ u — Y™ +1 = Ei=l CO j M ' 0?) ^ follows that, with Q denoting the set of all partitions 
Q = (Qi,Q2,Q3,Qi) of {I,--- ,<?"} such that Qi UQ 2 UQ 3 + 0, 

4 

c = e where ^(«)" = n^^')? and ^i.i)? = n c(j;<,o- 



In particular, 



s<t 



E{sup\U(j,i'K-U(P,i')«\) < J2 N (Q)t, where iV(Q)? = E(sup| £>(Q)? 

QeQ s -* 



i=0 



(A.27) 



3) We now evaluate N(Q)™, starting with the case where Q is such that, among the 
three sets Qi,Q2,Q3, a single one, say Qj, is a singleton, the other two being empty. We 
then have Qj = {r} for some r £ {1, • • • , q"} and »;(Q)J* = C(j; <, )i »7(Q4, 4)? . With the 
variables 5" = rj(Q)f and the filtration T~Lf = J~iA„ ® {5j+«"+(2<j"+i)£: n i aR d by (A. 25) and 
(A. 26), Holder's inequality, and the fact that r](Q^,4)^ is "%^-measurable, we see that the 
numbers a n and a' n of (A. 22) satisfy for any p > 1: 

a n < + fjf')A n , a' n < K p {{k n + /x")An) 1/p 



(with .fT, depending on q"). One can apply Lemma A. 4 with w n = fj," + (2q" + l)fe n to 
get 



— 1+P 



1-p 

2p ^ 



(recall that a n (f) < t/A n ). Note that by (3.3), 



1 p 
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for all sufficiently large n. 

In all other cases of Q G Q, there are at least two distinct integers r and r' in {1, • • • , q"} 
such that r £ Qj and r' G Qj/, with j,j' < 3 (we may have j = j'). Then rj(Q)" = 



CO'; CO''; *#> W Cf , and by (A.25) and Holder's inequality we obtain E(|ry(Q)f |) < 



#p ((Jfen + M") A n) 1/? ' for all p > I (with ET p again depending on q"). Then in this case 
N(Q)? < K p t(k n + M ") 1/P An /p_1 . 



These two estimates on N(Q)™, according to the case, plus (A. 27), imply the second 

□ 



part of (A. 24). 



A.3 Proof of the Results of Section [3] under (NO-2). 

We begin the proof of the results of Section [3] with the case of (NO-2), and as written before 
we can assume the strengthened versions (SH) and (SNO-2) of our assumptions. We have 
N n (t) = [t/A n ] in this case. 



The general idea is to reduce the problem to an application of Theorem A.l We fix an 
arbitrary finite subset Jq of J + , and if d = j^J§ we associate the following variables £ n 
and C and the process V, whose components are, when j = (ji, • • • ? J<?(j)) : 



c 



"■J 



n 

r=l 



{Xj r ~ ^(j)+(2r-l)fcj - R(kn,i), C J 



90) 

YlXjr 

r=l 



7 t 9(j) (A.28) 



By (SNO-2) and Lemma A. 5 these variables satisfy (A. 4 ) with w n = sup Jg j- (n(3)+2q(j)k 



1) and w = supj e j Q Note that w^A n — > by (3.3). We also write u' n = 2q(j)k n + 



/x(j) — 1, hence u' n A n — > as well. 

With Z' n , H n and U{]) n given respectively by ( |3.17| ), flA.10[ ) and ( |A.23[ ), a simple cal- 
culation shows that 



Z' t n j=H?j+ZliMhj)l where 

A(2,m = ^ (An ES tK< 7S " Jo 7? 0) d«) 
vl(3,j) 



(A.29) 



/An 



Since U = 7 9 ^) is a semimartingale satisfying (SH), we have that rjf = -7^= JiA ^^"(^s 



OiAjds satisfies |E(^ | J^aJI < and E((r ? ™) 2 | T iAn )\ < KA 2 n , by (AJj). This, 

the boundedness of 74, Doob's inequality for the discrete-time martingale X^i=o( r ?r — ^(^f I 
FiAn)) and the Cauchy-Schwarz inequality yield 



E( sup \A(2,j] 

s<t 



(A.30) 
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(the last term in the right being due to R< ^^ f^ N (t)-u' )A 7* ^ s )' Finally, we deduce from 
Lemmas A. 5 and A. 6 and from the boundedness of 7^ and the fact that E(||£ n || 2 ) < K, that 
for any p > 1: 



E(sup s < t |A(l,j)?|) < K p (t + V~t){k n A n /2 + k n /p A l J p 1/2 ) 

„„„ I Mi i\n\ ^ Kt 



su Ps<t |A(3,j)?| < 



(A.31) 



kV, A 



1/2- 



Now, we can proceed to the proof of the various results. 



have V A n Z'^ 



Theorem 



A.l 



0. By (A.29 



Proof of (b) of Theorem 3.1 For (3.7), it is enough to check that for all j 6 Jo we 

this amounts to have \J A n Hl^ 3 — > 0, which follows from 

P 

— > for I = 1,2,3. The latter is an obvious consequence 

□ 



and y/E^A(l,j)% 



of (A.30)-(A.31), plus &3.31. 



Proof of Theorem 



3.5 



The covariance a? k of (A.6) is denoted by Y)^ , and a simple 



calculation shows that it is given by (3.8). Thus the M 1 ^ -valued limit Z' T in Theorem 



3.5 



is exactly the limit Ht in Theorem A.l as given by (A. 12). Therefore, in view of (A.29), 
it is enough to prove that A(l,j)j, — > for I = 1,2,3 and each j G Jq. This is an obvious 



consequence of (A. 30) and (A.31), upon takings = 2(1 — 6) for the latter, plus the property 
(3.15), which yields in particular p > 1. □ 



Proof of Theorem 



3.2 



We have j = (ji, • • • ,j q ) and j' = 



,j' q >) in J + , and we 



associate the notation //, //', q", a' n {t) as before (A. 23). Instead of (A. 28) we consider two 
one-dimensional variables: 



r=l 



r=l 



Xj£'+(2r- l)fc„) - R (kn^,}')■ 
2q'k n — 1. Then we 



So £ n and £ /n satisfy (A.4), with w n = /i + 2qk n — 1 and w' n = fi 
associate H n by ( A. 21 ), with V t = 7^ and p n = k n + 1 and u n = \x + (2q + l)k n — 1. 



Similar with (A.29), we have 



A„F(j, j)')? - R(j)Rtf) Ill's' ds = V^Hf + £li A(Z)?, where 
A(l)? = R(k nj ]) ■ A n £fi A " ] -< jgjf + R(k n ;i') ■ A n YXLo^ 7,lC n+p „, 
A(2)? = A n (C/(j,jO t n -W(j,jO?) 

A(3)? = WAwj) flfojj')) (AnEfio Anl_< 7£ n - JoSf^ 
A(4)? = j) i?(fc n ; j') - /* 7s 9 " da. 



Applying Lemma A. 2 to H n defined by ( |A.10| ) with := 7? A and noticing that both 

R(k n ;j) and i?(& n ;j') are bounded, we see that £7|A(1)"| < K\J 'A n \fk n t. Moreover, using 
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again the boundedness of R(k n ;j) and R(h n ;j), and if we combine Lemmas A. 3 A. 5 and 



A. 6 (the latter with p = 2) and also (A. 30), we obtain, as soon as //" < 2k n and k n A n < 1: 



E 



A n U(i,i'Yt- Rti)Rtf) I 7l ds 

o 



< K(l + t + V~t)(Vk n A n + — 



(A.32) 



The right-hand side converges to as n — > oo under (3.3), and the proof is complete. □ 



Proof of (b) of Corollary 3.3 We use the same notation as in the previous proof. 



The definitions of Z^' J , E^ and E'j'j ,n , plus the boundedness of jt, yield, with B\ 



AnC/(j,j')? " m)m')Io 7f ^ and r n = £ meZ:H>Jfc , ® j'+J " R(jW)\- 



gjj'.-.Bij' / 7 f ds <^r n +Ay 2 (|zr Jej 'i+E(i z : 

v m=1 



l+l# 



Lemma 



A.2 



+(2*4+1)5?. 
(A.33) 

yields E(|i?^ J "|) < K^tjk~ n , uniformly in j" = j © j' +m or j" = j +m © j' when 



m < k n . Then we apply ( ]A.29| ), ( ]A.30[ ), ( |A.31[ ) with p = 2, and ( |A.32[ ) to get 



£•>">" / 7 f da 



< if (1 + t) (r n + k' n (^/Kh~ n + 1/A£ 



On the one hand, |-R(j©j +m ) — i?(j)i?(j')| < if/(|m|— //')" as soon as |m| > hence r n — > 0. 



On the other hand, (3.11) yields k' n (\/k n A n + — > 0, and the proof is complete. □ 



A. 4 Proof of the Results of Section [3] under (NO-1). 

Now we turn to the results under (NO-1), and without loss of generality we can and will 
assume (SH) and (SNO-1). There is no process 7 here, but the observation times T(n,i) 
are (possibly) random. We also fix the horizon T > 0. 



We consider a finite subset Jq C J + , and we use the notation (A. 28), and also the 



processes G n = (G n ' } )j eJo defined by (A. 9), with u n = N n (T) and u' n = + 2q(j)k n - 1 



and with the variables £ n = (£ n ' J ) given by (A. 28) and the corresponding £ and w n 



Lemma A. 7 For any fixed T, the variables G™ converge Too-stably in law to a centered 
Gaussian M.^ -valued variable G = (G^)j^j independent of J 7 ^ and whose covariance matrix 



is E JJ , as defined by (3.8). 



In other words, the limit G is the value at time 1 of the process B of (A.8), with the 
matrix a = E. 



Proof. The key point here is the independence between the noise and the cr-field T^. With 
B as above, we need to prove that, for any bounded J^-measurable variable $ and any 
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bounded function / on which is continuous for the product topology, we have 

E($/(G?)) -»• E(*)E(/(B0). (A.34) 



In fact, (a) of Theorem A.l can be applied to the (random, J^-measurable, and going to 



oo) sequence u n = N n (T). We get that 



G n — > B\, conditionally on F a 



which in turn yields (A.34) in a straightforward manner. 



□ 



Form now on, we basically reproduce the arguments of the previous subsection. We 



need to compare the variables ZJp of (|3. 161) and the variables defined, above. If — 
N n (T) + 1 - /x(j) - 2qk n and j = (ji, ■ ■ ■ ,j q ), we observe that 

Z ^ = G^ +A(i) n + A(j)' n , where 
A(j) n = ^/NjT) (R(k n ;i)-R(i)) 

A '^ n = ^ N n (T) ^'i=0™ (n?=l(^+j r ~~ Yi+nQ)+(2r-i)k n ) ~ ]lr=l (Xi+j r ~ X™ +A1 (j) + ( 2r -l)fc n 

(A.35) 

On the one hand, Lemma A. 5 and the fact that T/A < A n N n (T) < At (with A and At 
constant) yield 

\A(i) n \ < K/(kl^/K n ). (A.36) 



On the other hand, we have the following lemma, similar to Lemma A. 6 



Lemma A. 8 For any p > 1, and with K p depending on p, and on j through q(j) only, we 
have 

E(|A'(j)„|) < K v ^A~ n (k n + + (k n + A^ 1 ). 
Proof. With j = (ji, • • • ,j q ) and (i = /i(j), we set 

C( r )i = X T{n,i+j r ) - ^T(n,t+ / Lt+(2r-l)fc„)' (' ( r )i = Xi+jr ~ Xi+/j,+(2r- l)fe„" 

Let Q be the set of all non-empty subsets Q of {1, ■ • • , q}, the complement of which being 
denoted as Q c . We have - F? +/i+(2r _ 1)fcn = ((r)? + ('(r)?, hence 

* (fin = ErJ" ff> where = £ QeQ V (Q)f V '(Q)f and 

rim = Ure Q c«?> = nw C'(r)?. 

Using T/A < A n N n {T) < At once more, we see that 

a(j)„ [t/A n ] 

XXqwcq)? ^ su p mot. where #(Q)? = E vmr/m, 
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we see that 



i=0 



< 



; l ; ,/a; sup \n(q)?\. 

1 t<A T 



(A.37) 



Note that C'i r )? is ^(4; j r , + (2r — l)k n )f of the proof of Lemma A. 6 with 7 = 1. As for 
C( r )f, it is the same as £(l;j r ,fj,+ (2r — l)fc n )™, except that the T(n,i) are stopping times. 
However, since T(n,i + m) — T(n,i) < mAA n by (SNO-1), the estimate (A. 25) and (A. 26) 
are still valid here for Ci r )2> with = {7j+j+2<jfc n V^T(n,i) here. As to N(Q) n , it is exactly 



the same here and in (A.27). Henceforth, exactly as in this lemma, we obtain the desired 
estimate. □ 



Proof of (a) of Theorem 3.1 For (3.6), it is enough to check that Z^/y/N n (T) 0, 
whereas N n (T) X 1/A n . This amounts to having \J A n G™' J — > 0, which fo llows from 

p p 

and \/ A n A(j) n — )• and \/A n ^4 (j) n — ^ 0, which follow from (A. 36) and 

□ 



Lemma 



A.7 



Lemma IA.8I 



This follows from Lemma A.7 provided A(j) 



and 



Proof of Theorem 3.4 

F 

A (j)n — * 0- Under (3.3), these two properties in turn follow from (A. 36) and Lemma 
A8|with p = 2(1 - 6), under (13.151). □ 



3.3 



Proof of (a) of Corollary 

ness of and R(j') yield, similar with (A. 33): 

i 



The definitions of Z^, and ' n and the bounded- 



SJ'J I < r n + 



+ 



+ 1^ 



\/JV„(T) 



(A.38) 



By Lemma A.2, E(|G" J |) < i^VA^. By (A.35), (A. 36) and Lemma A.8, and setting 6 n 



kl /2 + l/(klAn rz ), we conclude E(|Z£ J |) < #<5„ if /i(j) < 2A; n . Therefore, since N n (T) > 



T/(AA n ), from (A.38) and the already proven fact that r n — > (because A4 — )• oo), plus 
the convergence in law of Z^, we deduce that E J ' J ' n — > S J,J as soon as k' n < k n and 
A„ — )• 0. These are implied by (3.11), and the proof is complete. □ 



k n 5 n 
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